RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling
نویسندگان
چکیده
Beyond the success in classification, neural networks have recently shown strong results on pixel-wise prediction tasks like image semantic segmentation on RGBD data. However, the commonly used deconvolutional layers for upsampling intermediate representations to the full-resolution output still show different failure modes, like imprecise segmentation boundaries and label mistakes in particular on large, weakly textured objects (e.g. fridge, whiteboard, door). We attribute these errors in part to the rigid way, current network aggregate information, that can be either too local (missing context) or too global (inaccurate boundaries). Therefore we propose a data-driven pooling layer that integrates with fully convolutional architectures and utilizes boundary detection from RGBD image segmentation approaches. We extend our approach to leverage region-level correspondences across images with an additional temporal pooling stage. We evaluate our approach on the NYU–Depth–V2 dataset comprised of indoor RGBD video sequences and compare it to various state-of-the-art baselines. Besides a general improvement over the state-of-the-art, our approach shows particularly good results in terms of accuracy of the predicted boundaries and in segmenting previously problematic classes.
منابع مشابه
STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling Supplementary Material
In the supplementary material, we present the analysis of semantic boundary accurary in Section 1. In section 2, we evaluate the oracle performance on NYUDv2 40-class task with our spatio-temporal data-driven pooling. In section 3, we analyze the groundtruth annotations of the NYUDv2 40class task. In section 4, we provide the qualitative results of the semantic segmentation results of the NYUDv...
متن کاملSTFCN: Spatio-Temporal FCN for Semantic Video Segmentation
This paper presents a novel method to involve both spatial and temporal features for semantic segmentation of street scenes. Current work on convolutional neural networks (CNNs) has shown that CNNs provide advanced spatial features supporting a very good performance of solutions for the semantic segmentation task. We investigate how involving temporal features also has a good effect on segmenti...
متن کاملTemporal Semantic Motion Segmentation Using Spatio Temporal Optimization
Segmenting moving objects in a video sequence has been a challenging problem and critical to outdoor robotic navigation. While recent literature has laid focus on regularizing object labels over a sequence of frames, exploiting the spatio-temporal features for motion segmentation has been scarce. Particularly in real world dynamic scenes, existing approaches fail to exploit temporal consistency...
متن کاملA Method for Modeling and Segmentation of Spatio-Temporal Shapes
This paper presents a method for modeling and segmenting spatio-temporal shapes. The modeling part is based on obtaining a description of the statistical variations of spatio-temporal shape parameters by studying a representative training set of examples. A deformable model of spatio-temporal shapes is used for segmenting similar shapes in new image sequences. The deformations of the model are ...
متن کاملA Hybrid Model and Computing Platform for Spatio-semantic Trajectories
Spatio-temporal data management has progressed significantly towards efficient storage and indexing of mobility data. Typically such mobility data analytics is assumed to follow the model of a stream of (x,y,t) points, usually coming from GPS-enabled mobile devices. With large-scale adoption of GPS-driven systems in several application sectors (shipment tracking to geo-social networks), there i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1604.02388 شماره
صفحات -
تاریخ انتشار 2016